21 research outputs found

    An Investigation of Recurrent Neural Architectures for Drug Name Recognition

    Full text link
    Drug name recognition (DNR) is an essential step in the Pharmacovigilance (PV) pipeline. DNR aims to find drug name mentions in unstructured biomedical texts and classify them into predefined categories. State-of-the-art DNR approaches heavily rely on hand crafted features and domain specific resources which are difficult to collect and tune. For this reason, this paper investigates the effectiveness of contemporary recurrent neural architectures - the Elman and Jordan networks and the bidirectional LSTM with CRF decoding - at performing DNR straight from the text. The experimental results achieved on the authoritative SemEval-2013 Task 9.1 benchmarks show that the bidirectional LSTM-CRF ranks closely to highly-dedicated, hand-crafted systems.Comment: Accepted for Oral Presentation at LOUHI 2016 : EMNLP 2016 Workshop - The Seventh International Workshop on Health Text Mining and Information Analysis (LOUHI 2016

    Bidirectional LSTM-CRF for Clinical Concept Extraction

    Full text link
    Automated extraction of concepts from patient clinical records is an essential facilitator of clinical research. For this reason, the 2010 i2b2/VA Natural Language Processing Challenges for Clinical Records introduced a concept extraction task aimed at identifying and classifying concepts into predefined categories (i.e., treatments, tests and problems). State-of-the-art concept extraction approaches heavily rely on handcrafted features and domain-specific resources which are hard to collect and define. For this reason, this paper proposes an alternative, streamlined approach: a recurrent neural network (the bidirectional LSTM with CRF decoding) initialized with general-purpose, off-the-shelf word embeddings. The experimental results achieved on the 2010 i2b2/VA reference corpora using the proposed framework outperform all recent methods and ranks closely to the best submission from the original 2010 i2b2/VA challenge.Comment: This paper "Bidirectional LSTM-CRF for Clinical Concept Extraction" is accepted for short paper presentation at Clinical Natural Language Processing Workshop at COLING 2016 Osaka, Japan. December 11, 201

    Scalable and Interpretable One-class SVMs with Deep Learning and Random Fourier features

    Full text link
    One-class support vector machine (OC-SVM) for a long time has been one of the most effective anomaly detection methods and extensively adopted in both research as well as industrial applications. The biggest issue for OC-SVM is yet the capability to operate with large and high-dimensional datasets due to optimization complexity. Those problems might be mitigated via dimensionality reduction techniques such as manifold learning or autoencoder. However, previous work often treats representation learning and anomaly prediction separately. In this paper, we propose autoencoder based one-class support vector machine (AE-1SVM) that brings OC-SVM, with the aid of random Fourier features to approximate the radial basis kernel, into deep learning context by combining it with a representation learning architecture and jointly exploit stochastic gradient descent to obtain end-to-end training. Interestingly, this also opens up the possible use of gradient-based attribution methods to explain the decision making for anomaly detection, which has ever been challenging as a result of the implicit mappings between the input space and the kernel space. To the best of our knowledge, this is the first work to study the interpretability of deep learning in anomaly detection. We evaluate our method on a wide range of unsupervised anomaly detection tasks in which our end-to-end training architecture achieves a performance significantly better than the previous work using separate training.Comment: Accepted at European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD) 201

    Image Anomaly Detection with Capsule Networks and Imbalanced Datasets

    No full text
    Image anomaly detection consists in finding images with anomalous, unusual patterns with respect to a set of normal data. Anomaly detection can be applied to several fields and has numerous practical applications, e.g. in industrial inspection, medical imaging, security enforcement, etc.. However, anomaly detection techniques often still rely on traditional approaches such as one-class Support Vector Machines, while the topic has not been fully developed yet in the context of modern deep learning approaches. In this paper we propose an image anomaly detection system based on capsule networks under the assumption that anomalous data are available for training but their amount is scarce

    ON THE ANALYSIS OF THE PERFORMANCES OF PARTICLE SWARM OPTIMIZATION ALGORITHM WITH GLOBALLY AND LOCALLY TUNED INERTIA WEIGHT VARIANTS

    No full text
    The investigation of the performance of Particle Swarm Optimization (PSO) algorithm with the new variants to inertia weight in computing the optimal control of a single stage hybrid system is presented in this paper. Three new variants for inertia weight are defined and their applicability with the PSO algorithm is thoroughly explained. The results obtained through the new proposed methods are compared with the existing PSO algorithm, which has a time varying inertia weight from a higher value to a lower value. The proposed methods provide both faster convergence and optimal solution with better accuracy

    Photoacoustic spectroscopy of sprayed CuGaxIn1−xSe2 thin films

    No full text
    CuGaxIn1−xSe2 thin films have been produced using spray-pyrolysis. The layers were deposited onto Mo-coated soda-lime glass at a substrate temperature of 623 K, and the gallium content of the layers varied. The layers were investigated using photoacoustic spectroscopy. The normalised photoacoustic spectra of CuGaxIn1−xSe2 films with x=0, 0.2, 0.4 and 0.5 showed a sudden fall in the photoacoustic signal at photon energies of 0.99, 1.15, 1.27 and 1.36 eV, respectively, which correspond to the respective direct energy band gaps of these films. The photoacoustic spectra also showed peaks at 0.75, 0.78, 0.82, 0.86 and 0.88 eV, respectively, in the sub-band gap region, which have been attributed to the defect states within these films

    Semantic Indexing of Multimedia Content Using Visual, Audio, and Text Cues

    No full text
    We present a learning-based approach to the semantic indexing of multimedia content using cues derived from audio, visual, and text features. We approach the problem by developing a set of statistical models for a predefined lexicon. Novel concepts are then mapped in terms of the concepts in the lexicon. To achieve robust detection of concepts, we exploit features from multiple modalities, namely, audio, video, and text. Concept representations are modeled using Gaussian mixture models (GMM), hidden Markov models (HMM), and support vector machines (SVM). Models such as Bayesian networks and SVMs are used in a late-fusion approach to model concepts that are not explicitly modeled in terms of features. Our experiments indicate promise in the proposed classification and fusion methodologies: our proposed fusion scheme achieves more than 10 % relative improvement over the best unimodal concept detector

    Rapid-Learning System for Cancer Care

    No full text
    Compelling public interest is propelling national efforts to advance the evidence base for cancer treatment and control measures and to transform the way in which evidence is aggregated and applied. Substantial investments in health information technology, comparative effectiveness research, health care quality and value, and personalized medicine support these efforts and have resulted in considerable progress to date. An emerging initiative, and one that integrates these converging approaches to improving health care, is “rapid-learning health care.” In this framework, routinely collected real-time clinical data drive the process of scientific discovery, which becomes a natural outgrowth of patient care. To better understand the state of the rapid-learning health care model and its potential implications for oncology, the National Cancer Policy Forum of the Institute of Medicine held a workshop entitled “A Foundation for Evidence-Driven Practice: A Rapid-Learning System for Cancer Care” in October 2009. Participants examined the elements of a rapid-learning system for cancer, including registries and databases, emerging information technology, patient-centered and -driven clinical decision support, patient engagement, culture change, clinical practice guidelines, point-of-care needs in clinical oncology, and federal policy issues and implications. This Special Article reviews the activities of the workshop and sets the stage to move from vision to action
    corecore